Baldwin, Timothy (2005) The Deep Lexical Acquisition of English Verb-particle Constructions, Computer Speech and Language, Special Issue on Multiword Expressions, Volume 19, Issue 4, pp. 398-414
نویسنده
چکیده
This paper proposes a range of techniques for extracting English verb–particle constructions from raw text corpora, complete with valence information. We propose four basic methods, based on the output of a POS tagger, chunker, chunk grammar and dependency parser, respectively. We then present a combined classifier which we show to consolidate the strengths of the component methods.
منابع مشابه
Uchiyama, Kiyoko, Timothy Baldwin and Shun Ishizaki (2005) Disambiguating Japanese Compound Verbs, Computer Speech and Language, Special Issue on Multiword Expressions, Volume 19, Issue 4, pp. 497-512
The purpose of this study is to disambiguate Japanese compound verbs (JCVs) using two methods: (1) a statistical sense discrimination method based on verbcombinatoric information, which feeds into a first-sense statistical sense disambiguation method, and (2) a manual rule-based sense disambiguation method which draws on argument structure and verb semantics. In evaluation, we found that the ru...
متن کاملDeep lexical acquisition of verb-particle constructions
Using the Pontryagin Duality and results of S. Fisher, P. Gartside (1991); P. Gartside, M. Smith (2007) and Y.D. Cornulier, L. Guyot, W. Pitsch (2008) we show that the space S(A) of all closed subgroups of a compact Abelian group A is countable if and only if there is a closed subgroup K of A such that K is topologically isomorphic to ⊕h i=1 Zpi⊕G and A/K is topologically isomorphic to T ⊕ G̃, w...
متن کاملUsing Distributional Similarity of Multi-way Translations to Predict Multiword Expression Compositionality
We predict the compositionality of multiword expressions using distributional similarity between each component word and the overall expression, based on translations into multiple languages. We evaluate the method over English noun compounds, English verb particle constructions and German noun compounds. We show that the estimation of compositionality is improved when using translations into m...
متن کاملIntroduction to the special issue on multiword expressions: Having a crack at a hard nut
Multiword expressions are an integral part of language. Their heterogeneous characteristics have proved a challenge to both linguistic and computational analysis. Their importance to language technology has long been recognised. In this special issue we include ten papers which propose a variety of approaches for finding and handling these expressions, both for building general purpose lexical ...
متن کاملGet out but don’t fall down: verb-particle constructions in child language
Much has been discussed about the challenges posed by Multiword Expressions (MWEs) given their idiosyncratic, flexible and heterogeneous nature. Nonetheless, children successfully learn to use them and eventually acquire a number of Multiword Expressions comparable to that of simplex words. In this paper we report a wide-coverage investigation of a particular type of MWE: verb-particle construc...
متن کامل